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ABSTRACT 

After reionization, emission in the 21 cm hyperfine transition provides a direct probe of neutral hy- 
drogen distributed in galaxies. Different from galaxy redshift surveys, observation of baryon acoustic 
oscillations in the cumulative 21 cm emission may offer an attractive method for constraining dark 
energy properties at moderate redshifts. Keys to this program are techniques to extract the faint cos- 
mological signal from various contaminants, such as detector noise and continuum foregrounds. In this 
paper, we investigate the possible systematic and statistical errors in the acoustic scale estimates us- 
ing ground-based radio interferometers. Based on the simulated 21 cm interferometric measurements, 
we analyze the performance of a Fourier-space, light-of-sight algorithm in subtracting foregrounds, 
and further study the observing strategy as a function of instrumental configurations. Measurement 
uncertainties are presented from a suite of simulations with a variety of parameters, in order to have 
an estimate of what behaviors will be accessible in the future generation of hydrogen surveys. We 
find that 10 separate interferometers, each of which contains ~ 300 dishes, observes an independent 
patch of the sky and produces an instantaneous field-of-view of ~ 100 deg 2 , can be used to make a 
significant detection of acoustic features over a period of a few years. Compared to optical surveys, the 
broad bandwidth, wide field-of-view and multi-beam observation are all unprecedented capabilities of 
low-frequency radio experiments. 

Subject headings: cosmology: theory — large-scale structure of universe — methods: data analysis — 
radio lines: general — techniques: interferometric 



1. INTRODUCTION 

The length scale of baryon acoustic oscillation (BAO) 
provides a "standard ruler" in the Universe, mak- 
ing it one of the most powerful probes of cosmic ex- 
pansion history. This characteristic scale is the dis- 
tance traveled by sound waves in the photon-baryon 
plasma before recombination, and has been detected 
both in t he cosmic microwave background (CMB) at 
z ~ 1000 (Hinsha w et al. II2009D and in galaxy surveys 
at z ~ (Eis enstein et al. II2005D . Recently, much ef- 
fort has been expended to establish feasible schemes 
of measuring the BAO features accurately at a vari- 
ety of redshifts. It is commonly believed that a com- 
bined analysis of BAO measurements and other inde- 
pendent observables can substantially improve the cur- 
rent constraints on the equatio n of state of dark energy 
and its time depend e nce Ce.g. [Seo k, Eisenstein I [20031: 
Hu fc Haiman I 120031: iZhan et al. I 120081: iZhang et al. I 
200a iSamushia fc Ratra Il2009t iWang et al. 1120111 ). 

Since emission in the 21 cm hyperfine transition 
can be safely assumed to be proportional to the cos- 
mic density of neutral hydrogen, measurement of in- 
tensity fluctuations in the cumulative 21 cm emission 
appears as a very promising technique to trace large 
scale structure (LSS) over a wide range of redshifts. 
In this manner, BAO signatures may be detected with 
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the structure of interest is certainly above the non-linear 
scale and the 21 cm radiation is sensitive to neutral hy- 
drogen regardless of whether it is part of a resolved ob- 
ject, fine angular resolution is not necessary for such ex- 
periments. Thus the blind 21 cm surveys could be more 
efficient than traditional galaxy surveys at high redshifts 
where individual sources are hard to be resolved in suf- 
ficient numbers. Moreover, the 21 cm experiments have 
the advantage of simultaneously exploring a large red- 
shift range, benefiting from the broad frequency band- 
width allowed by radio telescopes. Furthermore, the 21 
cm emission line can provide a direct and precise mea- 
sure of the redshift, which is unlikely to be true in galaxy 
surveys. A considerable survey volume is necessary for 
21 cm measurements to achieve the precision required to 
detect large-scale features. Cylindrical radio telescopes 
with a wide field of view have been proposed to per- 
form such surveys in the future dPeterson et al. 1 120061; 
iSeo et al. I l20ld lAnsari et aT~l 1201 ID . and near term 
measurements using existing telescopes are discussed by 
iMasui et al.l pOlOT ). 

However, all of the most important LSS surveys have 
been taken with optical but not radio telescopes. BAOs 
are weak features imprinted in the 21 cm cosmological 
signal which has a strength of ~ 10 fiK near z ~ 1. In 
addition to the extremely faint signals themselves, fore- 
ground contamination seems formidable in low-frequency 
experiments. The Galactic and extragalactic foreground 
sources exceed the co smological signa l by a t least four 
order s of magnitude (TBowman et al. 1 120091; iLiu et al. I 
2009; IChang et al.l 120101: IWang' et al. I I2010D . Radio 
techniques have so far yielded direct detections of the 21 
cm emission only to z = 0.24 (|Lah et al. II2007D . indicat- 
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Fig. 1. — Brightness temperature images of foregrounds simulated at v = 710 MHz. The angular resolutions are nearly 0.6 arcminutcs 
(left panel) and 5 arcminutes (right panel). 



ing a significant uncertainty in future neutral hydrogen 
surveys. The ability to constrain the nature of dark en- 
ergy represents the ability to detect BAO wiggles in the 
21 cm power spectrum. Keys to these detections lie not 
only in experimental strategies but also in data process- 
ing techniques. In this paper, we discuss the detectability 
of BAOs in future hydrogen surveys around z = 1. Sim- 
ulations of the 21 cm interferometric measurements are 
described, and a probable method of foreground subtrac- 
tion is also considered. We present predictions for pos- 
sible systematic and statistical errors in the BAO scale 
estimates over an important parameter range, showing 
the range of behaviors we might expect in future observa- 
tions. We note that the foreground strategy in the post- 
reionization era is very different from that in the epoch 
of reionization (EoR). At high redshifts [z > 6), most of 
the neutral hydrogen is distributed in intergalactic space, 
nearly separated from galaxies. Hence the extragalactic 
point sources can be treated as just foreground contam- 
inants. However, the neutral hydrogen is locked in the 
overdense regions such as gas-rich galaxies at medium 
redshifts, indicating that the redshiftcd 21 cm line will 
be emitted from the extragalactic point sources, like the 
continuum. Galaxies therefore play two completely dif- 
ferent roles here: foreground sources as well as cosmolog- 
ical signal emitters. When we consider the foreground 
subtraction technique at z ~ 1, it must be kept in mind 
that only the continuum of galaxies should be cleaned 
out. Efficient operations should not remove foregrounds 
at the sacrifice of desired signals. We can neither sim- 
ply discard the contaminate d pixels in sky maps as sug- 
geste d in CMB observations (|Tegmark fc Oliveira-Costal 
1998) nor remove point sources with the prior sky mod- 



els as discussed in reionization experiments (jDatta et al. I 
l2010f ). Recently, analyses of radio observations at higher 
frequencies (e.g., 610 MHz) have provided some possi- 
ble ways to separate the post-reioni zation 21 cm signal 
from astrophy sical foregrounds (e.g. IChang et al. 1120101 ; 
IGhosh et al. M2011alTbh . And our work can be used as a 
starting point for modeling BAO detections in blind 21 
cm surveys up to z ~ 5. 

This paper is organized as follows. In Section 2, we in- 
troduce the proper simulations of the foreground model 
as well as the radio interferometric measurements. Our 
foreground removal technique is also described. Next, 
we apply the subtraction method to the simulated data 
cube in Section 3, and further estimate the sensitivities 
on measurements of the 21 cm power spectrum. We an- 
alyze results from a series of simulations in order to un- 
derstand how the measurement uncertainties depend on 
the data reduction and instrumental configurations. Fi- 
nally, we summarize our main conclusions in Section 4. 
Throughout the paper we adopt a concordance cosmol- 
ogy of Q = 0.265, n A = 0.735, ft & = 0.044, h = 0.71, 
n s = 1 and cj 8 = 0. 772, as revealed by t he WMAP three- 
year observations (|Spergel et al. |[2007l) . 

2. METHOD 

2.1. simulation of foregrounds 

We emplo y Mon te- Carlo simulations presented by 
IWang et al. I ([2010) and references therein to generate 
the brightness temperature sky maps at relevant frequen- 
cies, which incorporate contributions from three main 
components: (l)Galactic synchrotron and free-free emis- 
sion; (2)galaxy clusters; and (3)extragalactic discrete 
sources such as star-forming galaxies and AGNs. Ran- 
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Fig. 2. — Arrangement of antenna elements in the radio interferometer (left) and the corresponding density of visibility measurements in 
the uv plane at 710 MHz with integration time of 24 hours (right). All the 487 dishes are involved, distributed in 7 circles with diameters 
of D = 2, 6, 14, 30, 60, 100, 150 m. 



dom variations of morphological and spectroscopic pa- 
rameters as well as the evolution of radio halos in galaxy 
clusters are also considered. The out line o f these fore- 
ground simulations has been given in iMaol (l20ll) . and 
more details can be found in the original papers. 

All the simulations are performed on 1024 x 1024 grids 
over a field-of-view of 10° x 10°. Our fo reground box 
is kindly provided by iWang et al. I (|2010T ) . Since most 
BAO experiments are designed to provide low angular 
resolutions, we further arrange the sky maps onto grids 
of 120 x 120 pixels, corresponding to the angular resolu- 
tion of 5 arcminutes. Along the line of sight, we use the 
frequency range extending from 500 MHz to 900 MHz 
with a resolution of 1 MHz. In Figure 1, the simulated 
foreground maps with high and low angular resolutions 
are shown separately. 

2.2. simulation of radio interferometric measurements 

To calculate the sensitivity of a BAO measurement, we 
need to specify both the instrumental configurations and 
the observing strategy. Many factors contribute to the 
instrumental response of a radio interferometer. Among 
the most important to consider are the field of view, 
angular resoluti on, collecting area, ant enna distribution, 
and bandwidth (|Bowman et al. 11200(1 . 

In this work, the smallest scale we consider is set to 
be the third BAO peak whi ch locates at ~ 0.5 degree 
angular scale around z = 1 (jAnsari et al. I f20TTh . And 
thus a radio instrument of modest size, typically with 
diameter D ~ 100 m, can be used to resolve the BAO 
features. Moreover, BAO surveys should be performed 
in various pointing directions in order to observe a large 
volume of the universe which is crucial to obtain small 
statistical errors on measurements of the LSS power spec- 
trum. Hence the characteristic observing time scales with 
the collecting area, the instantaneous field of view (i.e., 
FOV) and the number of pointings. Obviously, a large 
FOV ~ (A/Ddish) 2 is important to raise the survey effi- 
ciency. Here, we introduce an interferometer consisting 
of iVdish = 487 dishes with diameter Ddish = 1 m. All the 
dishes are nearly coplanar and organized in circles with 
different diameters of D = 2, 6, 14, 30, 60, 100, 150 m. For 
D < 30 m, the dishes are spaced roughly 1 m from each 



TABLE 1 
Experimental Specifications 



Parameter Value 



fiHi& 0.0005 

Array diameter, D (m) 100 

Number of dishes, iVdjsi, 330 

System temperature, T sys (K) 50 

Bandwidth, B (MHz) 100 

Central frequency, vq (MHz) 710 

Spectral resolution, Au (MHz) 1 

Order of polynomial fit, N 3 

Sky coverage, / sky 0.5 



Note. — Parameters are listed for the fiducial model. 
The central frequency corresponds to z = 1 in 21 cm mea- 
surements. 



other, otherwise, the dishes are spaced roughly 3 m from 
each other. In the left panel of Figure 2, we show the 
array layout for the 487 dishes with a maximum base- 
line of 150 m. The corresponding uv sampling density 
at v = 710 MHz is plotted in the right panel. One may 
note that, when the dishes are limited in quantity, the 
dense-to-sparse arrangement could promise complete uv 
coverage in the central area, but also provide some long 
baselines for good angular resolutions. 

How shall we carry out the large- volume survey? Gen- 
erally speaking, a larger sky coverage requires a longer 
observation time. However, if we could observe several 
independent fields of view simultaneously, the total sur- 
vey volume will be extended without prolonging the ob- 
serving time. Here we consider a 21 cm survey with 10 
separate interferometers, each situated at a different lo- 
cation and observing a different patch of the sky. Com- 
pared to a single interferometer, the cost of building the 
proposed telescope will no doubt increase. And we need 
to find a balance between price and efficiency. For com- 
putational simplicity, our reference measurement is cho- 
sen to be a deep observation of a single target field cen- 
tered on the North Celestial Pole (NCP). The effect of 
the earth rotation synthesis is not taken into account in 
the reference observation. We assume that observations 
pointing in different directions can be performed in the 
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Fig. 3. — The top row shows the dirty map (left panel) and the corresponding uv map (right panel) observed at u = 710 MHz under the 
fiducial model, while the bottom row shows the frequency spectra in a random image pixel (left panel) and uv pixel (right panel) across 
the frequency range 660 < u < 760 MHz. 



same way as our reference measurement. 

In order to understand how the instrumental param- 
eters impact the future BAO measurements, our sim- 
ulations are passed thro ugh the observational pipeline 
described in lMao I (|2012l ). In what follows, we briefly in- 
troduce its scheme. Since the 21 cm signal is very faint 
compared to foregrounds and its low-frequency proper- 
ties are highly uncertain, we do not include it in our sky 
model. Besides, the effect of foreground removal on the 
cosmological signal is outside the scope of this work and 
hence ignored. The original image cube is first estab- 
lished on the basis of the brightness temperature distri- 
bution of foreground sources. Also, the primary beam is 
simply assumed to be unity within the FOV. At each fre- 
quency channel, the sky image is related to visibilities via 
the two dimensional Fourier transform. Subsequently, we 
estimate the noise visibilities with one year integration. 
The rms noise per visibility per frequency channel can 
be w ritten as (R ohlfs fc Wilson 1 120041 IMcQuinn et al. I 
12006ft 



in which A e and fib are the effective area and the beam 
solid angle of a single dish respectively, Ai/ is the band- 
width of a single frequency channel, and t is the total 
integration time for sampling a given (u, v) location. For 
the reference observation, we assume the system temper- 
ature to be T sys — 50 K according to current technology. 
In addition, we approximate the integration time t = 
tN(u, v), where r = 5 s is the accumulation duration for 
each visibility measurement, and N( u,v) is the number 
of independent samples in t hat pixel (Bharad wai fc Ali I 
120051 : iBowman et al. Il2009f ). Because the thermal noise 
is random, we draw complex visibilities from Gaussian 
distributions with zero mean and rms described above. 
Finally, the real-world sampling is carefully performed 
in the uv plane. The contribution of any single visibility 
measurement is applied to only one grid cell. We further 
normalize the baseline distribution to ensure that each 
pixel in the sampled part of the uv plane has the same 
weight. With the uniform weighting, we artificially em- 
phasize the information contained in long baselines and 
increase the effective resolution of the derived sky maps. 
Following all these simulations, we generate our image 
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Fig. 4. — Foreground subtraction technique applied to the simulated interferometric measurements. The top row gives two cuts for the 
input visibility cube, showing a uv map at a single frequency channel v = 710 MHz (left panel) and a slice along the frequency direction 
and w-axis (right panel). The bottom row gives the same cuts, but for the cleaned visibility cube following foreground subtraction. 



cube and visibility cube representing realistic observa- 
tions. Figure 3 shows the dirty map and the correspond- 
ing uv map at v = 710 MHz. Additionally, the frequency 
spectra in a random image and uv pixel are also plotted 
separately in the bottom row. As illustrated in these 
two panels, the frequency coherence of foregrounds will 
be des troyed in real space owing to the "mo de-mixing" 
effect (jBowman et al. |[200l ILiu et al. Il2009l ). 

2.3. foreground subtraction 

It is generally agreed that the intergalactic medium 
(IGM) contains the majority of baryons in the Uni- 
verse, and is completely ionized by UV radia tion beyond 
z = 6 (jBecker et al. Il200lt iFan et al. Il200l . Neverthe- 
less, the damped Lyman series absorption seen in quasar 
optical spectra indicates that a significant fraction by 
mass of neutral hydrogen do es reside in the interstellar 
medium (ISM) of galaxie s (jStorrie-Lombardi fc Wolfe"! 
120001: IP eroux et al. ibool . making it possible to use the 
neutral hydrogen as a tracer of the underlying large-scale 
structure in the post-reionization epoch. As we men- 
tioned above, most of the neutral hydrogen is outside 
and inside of galaxies before and after reionization re- 
spectively, indicating that new developments in methods 
of isolating the 21 cm signal from foregrounds will be 
required in post-reionization experiments. In such ob- 
servations, emissions of galaxies consist of two compo- 
nents: the radio continuum as well as the redshifted 21 
cm line. The former is relatively featureless, however, the 
latter is expected to fluctuate rapidly in the frequency 
and angular directions. Both these radiations are emit- 
ted from extragalactic point sources, but have fundamen- 



tally different qualities. In addition, the instrumental 
response changes with observing frequency, and thereby 
creates different sidelobe patterns across the measured 
sky maps, inducing the "mode-mixing" effect which will 
destroy the foreground smoothness in frequency space. 
The frequency decoherence has been shown clearly in the 
left hand column of Figure 3. 

Over the last decade, much effort has been made 
in exploring possible methodolo gies f o r foreground 
subtraction (e.g. iMatteo et al I 120021: lOh fc Mark ' 



2003; 
2004 : 



iZaldarriaea et al 



ISantos et al. 



.20041 : iFurlanetto fc Briggs 
20051: IWang et al. I 1200ft 



Morales et al. 11200ft iGleseret al. 1120 08 f Bo wman et al 



20091: ILiu et al. I 120091 1b!: iHarker et al 



2011 



2010; ILiu fc Tegmark I JoTl 



iGhosh et al. 



iPetrovic fc Oh I 
l2011all bTr~ lAnsari et al. I 120111 : 

Chapman et al. I I2012T ). The most widely discussed 
technique concentrates on the spectral fitting along 
each line-of-sight (LOS), dealing with the removal of 
diffuse emission and unresolved point sources under the 
assumption that bright point sources have been cleaned 
out with high accuracy and precision. Owing to the 
"mode-mixing" effect, excision of point s ources is always 
first carried out in a clean ing p r ocess ([Morales et al. I 
2006t iDattaet al. I [20091 120101: iPindor et al. I 120111: 
Bernardi et al. 1 120111) However, for the proposed neu- 
tral hydrogen surveys, the extragalactic point sources 
should be treated not only as foreground contami- 
nants but also as signal emitters. This means that 
the point sources shou l d not be removed, except for 
their continuum. iMao I (|2012l ) introduced a foreground 
removal technique in Fourier space, which handles 
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Fig. 5. — Angular power spectra and measurement variance at 
z = 1 for the fiducial model. Top panel: the dash-dotted line 
and dotted line correspond to the input foregrounds and detector 
noise, respectively. The solid line represents the expected 21 cm 
signal, and the dashed line shows the post-subtraction noise in the 
cleaned sky map, including residual foregrounds and detector noise. 
The vertical lines indicate locations of the first three BAO peaks. 
Bottom panel: variance in the estimate of 21 cm power spectrum 
for an integration time of 1 year. 



the confusion-level contaminants and the bright point 
sources as equivalent. They fit the visibility spectrum 
using a weighted low-order polynomial along the LOS. 
And any special treatments to point sources have 
been sidestepped. In this method, the inverse-variance 
weighting scheme protects the frequency coherence of 
foregrounds, and most importantly, the polynomial 
function guarantees that only the spectrally smooth 
component of the point sources' radiation will be 
subtracted. One can therefore hope that most of the 
21 cm signal emitted from the point sources will be 
preserved well in the polynomial-subtracted residuals. 
Furthermore, a key feature of this cleaning method is 
that the algorithm is blind, indicating that it is not 
necessary to resolve point sources in future observations. 
One can thus get a good start because the low-frequency 
BAO surveys will be performed with limited resolutions. 
Here, this formalism is employed to remove the radio 
foreground contamination. Refer to the original paper 
for more details of the cleaning method. 

3. RESULTS 

In this work, we aim to study the feasibility of de- 
tecting BAOs in the 21 cm intensity fluctuations after 
rcionization, using a radio interferometric array. The 
measurement uncertainties depend on many parameters, 
and we will mainly consider the impacts of telescope 
design, observation strategy and foreground subtraction 
techniques. To get a sense of the precision which we are 



able to achieve, we first describe a fiducial observation 
in Section 3.1, and further quantify its sensitivity in Sec- 
tion 3.2. A detailed account of the extent to which the 
various parameters matter is represented in Section 3.3. 

3.1. fiducial model 

Table 1 lists the parameter values adopted in our fidu- 
cial model. Most of these parameters reflect the cur- 
rent technology, and some specifications may be subject 
to change because of the developing experiments. Un- 
der the fiducial model, we perform the simulations de- 
scribed in §2.2, and then use the proposed foreground 
cleaning method on the visibility data cube. Figure 4 
illustrates the ability to remove the foreground contam- 
ination with instrumental response. The top row of the 
figure shows the simulated visibility cube. The left panel 
is a map from a single frequency channel, and the right 
panel plots & v-v plane slicing along the frequency direc- 
tion. Moreover, the bottom row shows the same cuts, but 
in the residual data cube following the polynomial sub- 
traction. Qualitatively, the residual visibilities have been 
suppressed to a level of order ~ 10 ^K. This represents 
the fiducial scenario that can be achieved by our fore- 
ground cleaning method to the simulated interferometric 
measurements. In principle, the amplitude of the residu- 
als could be made arbitrarily low by increasing the order 
of the polynomial used in the spectral fitting. However, 
the process of foreground subtraction may accidently re- 
move the spectrally smooth component of the cosmolog- 
ical signal, and hence lead to suppression in the final 
estimate of power spectrum primarily on large scales. 
These artifacts have b e en discussed in previous work (e.g . 
IBowman et al~l [20091 : IHarker et al~l |20ToT; iMaol [20121) . 
We should thereby perform the polynomial fit with or- 
der as low as possible. 

3.2. sensitivity estimate 

After foreground subtraction, we are left with a resid- 
ual data cube containing the fitting errors and the ther- 
mal noise. With this data set in hand, we can compute 
the analytic approximations of measurement uncertain- 
ties in a simple way that is of ten accurate enough for 
studying the experiment design. IKnox I ([1995D has shown 
that the noise in a pixelized map could be modeled as a 
random field with an angular power spectrum given by 

Cf = v-ieW+V = o&Sl&eW+V, (2) 

where we take the pixel solid angle to be Q p i x = #f w hm x 
0fwhmj arid 9b is the standard deviation for a Gaussian 
beam function in the experiment. The most straightfor- 
ward way of estimating the rms error in the map is to 
specify cr pix . For the observation we simulate here, cr p i X 
is defined as the standard deviation of the temperature 
fluctuations in the residual sky map, which has contri- 
butions from the residual foregrounds and other system- 
atic effects. This approximation only strictly holds for 
a Gaussian distribution, and the foreground residuals in 
the sky map may not be a Gaussian field, but will be spa- 
tially correlated. However, we ignore these and use this 
relationship here since it allows considerable simplifica- 
tion. The estimate of the 21 cm power spectrum could 
therefore be described as a normal distribution with a 
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Fig. 6. — Effect of the cosmic density of neutral hydrogen together with the bias factor: sensitivity estimates for the fiducial model but 
with various values of f^mfe- The top row as well as the bottom left panel give the angular power spectra as defined in the top panel of Figure 
5. The bottom right panel shows the possible variances for Chi^ = 0.0001 (dashed line), Ouib = 0.0005 (solid line), and Ouib = 0.003 
(dash-dotted line). 



variance (|Knox 1119951) 



ACV 



(2£+l)f sky 



[Ct + Cj, 



(3) 



Thus the simple estimate of the accuracy with which the 
expected power spectrum could be measured directly de- 
pends on the sampling variance, cosmological signal and 
pixel noise. In our simulations, we never create the 21 
cm signal. Instead we base our estimate of the 21 cm 
power spectrum on an analytical approach 



P^{z, k) = T^P m {z, k) = T 2 b 2 P S5 (z, k) 



(4) 



in which Pss(z, k) and b represent the matter power spec- 
trum and bias factor respectively. The mean brightness 
temperature due to the 21 cm line can be written as 
(jChang et al. \&M ISeo et al. ll2010h 



T = 0.3mK 



HI 



10- 



n m (l + z) 3 + Q A 
0.29(1 + z) 3 



-1 -1/2 


"1+z" 




2.5 



1/2 

(5) 



Also, the angular power spectrum Ce is derived using the 
standard Limber's equation under the flat-sky approxi- 



mation (|Kaiserlll992l) . 

In Figure 5, we show Cg, Cf and ACi for the fidu- 
cial model. The dashed line in the top panel represents 
the angular power spectrum of the intensity fluctuations 
in the foreground subtracted sky map, compared to the 
input foregrounds (dash-dotted line), detector noise (dot- 
ted line) and the expected 21 cm signal (solid line). The 
vertical lines indicate the locations of the first three BAO 
peaks. One can see that the huge contaminants will be ef- 
fectively removed with obviously smaller residuals which 
are less than one part in 10 5 of original model. Therefore 
the first two peaks of BAO wiggles could be measured 
with required precision. However, for I > 1/(9;, the noise 
level increases rapidly. To a certain extent, this is in- 
duced by the fast growth of the thermal noise at small 
scales. In the bottom panel, the variance ACg changes 
with £: it decreases at the first and then increases. Its 
minimum value appears at £ ~ 500. From Equation 3, it 
is evident that the sampling variance as well as the cos- 
mological signal and pixel noise will together determine 
the quantity of ACg. Their effects at different scales will 
be studied in the next section. 
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Fig. 7. — Same as Figure 6, but for the effect of the array layout: sensitivity estimates for the fiducial model but with various arrangements 
of dishes. From the top left, top right to bottom left panel, the maximum baselines are -D ma x = 60, 100, 150 m respectively, and the 
corresponding total numbers of dishes are Afjish = 226, 330, 487. The bottom right panel shows the possible variances for D max = 60 m 
(dashed line), D ma x = 100 m (solid line), and Dmax = 150 m (dash-dotted line). 



3.3. exploration of parameter space 

Above we investigated errors in the estimates of BAO 
scales under our fiducial model, including the effects 
of foreground contamination and instrumental response. 
Now we use a set of simulations with various parameters 
to investigate how well we could expect to measure the 
acoustic scales in future 21 cm surveys. In each case, we 
examine only one of these parameters and keep all others 
fixed. 

3.3.1. theoretical parameters 

Observations of quasar absorption spectra indicate 
that the intergalactic medium is highly ionized and a 
significant amount of neutral hydrogen could be found 
only in galaxies in the post-reionization universe. In 
this work, there is no need to describe the HI distri- 
bution, but rather we make a quantitative estimate of 
the total neutral hydrogen content and study its evolu- 
tion at moderate redshifts. Our knowledge of the neu- 
tral hydrogen content is derived mainly from gas ab- 
sorption in the HI hyperfine transition. These obser- 
vations find that the comoving HI density in galaxies 
is almost constant with a density parameter of = 



0.001 beyond z = 1, only doubling over the redshift 
rang e z ss 2-4 (IPeroux et al. I [20051: 1 Wolfe et al. 1120051: 
iProchaska &: Wolfe I |2000Y Moreover, the proposed HI 
survey traces galaxies but not the underlying matter dis- 
tribution, which over-weights the overdense regions and 
thus introduces an inherent bias. Since this bias factor 
is as yet fairly unconstrained observationally, we ignore 
its time and scale dependences and a s sume a constant 
value for it (e.g. iChang et al. 1 120081: iWvithe fc Loeb I 
I2009t ISeo et al. 112010 ). Thislrould be accurate enough 
for our purposes in the present work. Based on these, 
we use the halo model to calculate the mass power spec- 
trum, and adopt n m b = 0.0001, 0.0005, 0.003 for the pes- 
simistic, fiducial and optimistic predictions in the theo- 
retical model respectively. Figure 6 investigates these 
three scenarios. It appears to be true that the BAO 
measurements would benefit from increasing the inten- 
sity of 21 cm emission which is proportional to the cos- 
mic density of neutral hydrogen in the universe. And in 
the bottom-right panel, the variances of power spectrum 
estimates increase with the amount of neutral hydrogen 
especially on large scales. 
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Fig. 8. — Same as Figure 6, but for the effect of the covered sky area: sensitivity estimates for the fiducial model but with / s k y =0.1, 0.5, 1.0. 
The bottom right panel shows the possible variances for f s ^ y = 0.1 (dashed line), / s k y = 0.5 (solid line), and f s ^ y = 1 (dash-dotted line). 



3.3.2. instrumental parameters 

In a radio interferometric experiment, different distri- 
butions of dishes give different instrumental responses. 
We consider three cases here, in which dishes within 
diameters of 60, 100, 150 m are involved respectively 
and the total numbers of dishes are 226, 330, 487 cor- 
respondingly Results are shown in Figure 7. In the 
top panels and the bottom-left panel, it is clear that the 
foreground subtraction can be done well using all these 
arrays, and there is no significant difference in perfor- 
mance at I < 300. Nevertheless, one may note that the 
noise in the residual map (dashed lines) grows rapidly at 
large I due to the exponential factor in the noise term. 
Meanwhile, the thermal noise (dotted lines) will increase 
faster for lower angular resolutions. In the bottom-right 
panel, for small values of £, AC; is limited by the sample 
variance together with the cosmological signal, and thus 
decreasing the beam size 6b decreases ACi only slightly. 
But for I > l/0b, the reduction in ACe is dramatic, 
greatly expanding the available range of scales. For this 
reason, it is suggested that a BAO experiment should 
make measurements at resolution < 10 arcminutes in or- 
der to resolve the higher harmonics of BAO features in 
the extracted power spectrum. Moreover, the smaller 



beam experiment will provide information over a wider 
range of modes. 

Now we focus on the effect of increasing the field of 
view. The observed fraction of the sky is defined by / s ky • 
In Figure 8, we compare the possible errors in BAO mea- 
surements with / s ky = 0.1,0.5,1. Comparing the top 
panels to the bottom-left one, we do not find that the 
residual noise level (dashed lines) has reduced obviously. 
The most significant effect appears in the bottom-right 
panel, where the variance ACe seems to be proportional 
to the factor 1 / / s k y . The errors expected from the survey 
of a larger covering area appear smaller at most scales. 
It indicates that an experiment with poorer sky coverage 
will be less precise at every value of £, since the increase 
in the number of available modes decrease the sample 
variance errors. The figure also makes it clear that we 
are unable to extract meaningful information with our 
current methods until / s k y is large enough. For the BAO 
detections, it would be important to cover multiple sky 
areas in a fixed observing time, and hence reduce the 
large-scale errors which are dominated by sample vari- 
ance. 

3.3.3. algorithmic parameters 
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Fig. 9. — Same as Figure 6, but for the effect of the frequency resolution: sensitivity estimates for the fiducial model but with the 
foreground subtraction performed by fitting three-order polynomials to the visibility spectra with various frequency resolutions. In the 
bottom right panel, the possible variances are plotted for Ai/ = 1 MHz (solid line), Aiv = 2 MHz (dashed line), and Aiv = 3 MHz 
(dash-dotted line). 



In this section, we examine three parameters relating 
to the foreground subtraction algorithm: frequency res- 
olution, bandwidth of polynomial fit and order of the 
fitting polynomial. The results are plotted in Figure 9, 
Figure 10 and Figure 11 respectively. 

From Figure 9, one can see that the lower frequency 
resolutions will make little difference in the measured 
power spectrum, indicating that the measurement un- 
certainties are not sensitive to the frequency resol ution. 
This coincides with the conclusion presented in iMao I 
(2012) which illustrated that the proposed foreground 
cleaning method would provide the residual data with 
similar quality even performed with various frequency 
resolutions. 

In Figure 10, we investigate the effect of changing the 
frequency range over which the foreground spectra are 
fitted with smooth functions. At first sight, a smaller 
frequency bandwidth seems favorable for the BAO de- 
tections. After foreground cleaning, the overall level of 
noise (dashed lines in the top panels and the bottom-left 
panel) would increase with the bandwidth, and hence en- 
large the variance AC? especially at I > 300. However, 
we caution that the fewer frequency channels will de- 



grade the signal-to-noise ratio in each band, furthermore, 
the cosmological signal on smaller scales may be acciden- 
tally removed during foreground subtraction. One should 
therefore approximate the frequency spectrum of fore- 
ground contaminants over a frequency range that is as 
broad as possible, with the constraint that the produced 
residuals lie below the expected level of cosmic signal. 
For real observations, a choice of the frequency range 
would be made in the context of a specific experiment. 

Compared to the two parameters we mentioned above, 
the order of the polynomial fit will influence the results to 
the maximum extent, which is clearly shown in Figure 11. 
As we see in the top-left panel, there are insufficient de- 
grees of freedom to remove radio foregrounds efficiently, 
and thus the rms noise (dashed line) in the residual maps 
may exceed the expected cosmological signal (solid line) , 
inducing the destruction of measured BAO signatures. 
And obviously, the residual noise (dashed lines in the 
top-right and bottom-left panels) could be reduced sig- 
nificantly by increasing the order of polynomial used in 
foreground fits. Similarly, the bottom-right panel repre- 
sents dramatic reductions in the detection variance. Nev- 
ertheless, as emphasized by previous work, some of the 
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Fig. 10. — Same as Figure 6, but for the effect of the bandwidth: sensitivity estimates for the fiducial model but with the foreground 
subtraction performed by fitting three-order polynomials to the visibility spectra over various frequency ranges. In the bottom right panel, 
the possible variances are plotted for B = 50 MHz (dashed line), B = 100 MHz (solid line), and B = 200 MHz (dash-dotted line). 



cosmological signal may be mistaken if N is too high (e.g. 
iFurlanetto et al. 1120061 iLiu et al. 1120090 . In this work, 
we obtain believable measurements of the first two BAO 
peaks using a third-order polynomial to fit the real and 
imaginary parts of the visibility spectrum separately at 
each point in the uv plane. 

4. DISCUSSION 

Recently, it was realized that the redshifted 21 cm line 
emitted from neutral hydrogen can be used as a tracer 
of the large-scale structure even after reionization. Also, 
detection of BAOs on the inferred 21 cm power spectrum 
can be used to explore the geometry of the Universe es- 
pecially in the dark energy dominated regime. In this 
paper, we quantified the likelihood of measuring BAO 
features near z ~ 1 with a set of ground-based radio in- 
terferometers. Simulations of the 21 cm interferometric 
measurements have been described, taking into account 
the detector noise, the foreground contamination and the 
frequency-dependent instrumental response. Different 
from galaxy redshift surveys in optical and radio wave- 
bands, we developed a possible observing strategy with- 
out individual object detections. The foreground clean- 
ing technology has also been studied, which is very dif- 



ferent from both the CMB and EoR experiments. More- 
over, we investigated what the accuracy of BAO measure- 
ments is likely to be available in future 21 cm surveys, 
and further discussed how the measurement uncertain- 
ties would change if different assumptions are made for 
key configurations of the radio telescope as well as some 
potential parameters used in the foreground subtraction 
method. The blind surveys of the redshifted 21 cm emis- 
sion line have proven useful in offering information for 
acoustic scales at moderate redshifts, providing an im- 
portant complement to the existing galaxy surveys. 

In the fiducial scenario, we considered a pixel-by-pixel 
method to subtract continuum foregrounds from simu- 
lated visibilities, and found that a three-order polynomial 
can be used to well approximate the smooth spectrum. 
And in the cleaned sky maps, the detector noise together 
with the residual foregrounds will contribute to the pixel 
noise. Just as we saw in Figure 5, this noise term (dashed 
line) could be suppressed to a level of order ~ 10 fiK at 
I < 500 after a whole year integration. The imprints of 
the first two BAO peaks are therefore possible to be re- 
solved on the 21 cm angular power spectrum. Although 
BAO experiments may not require fine resolutions, we 
found that an angular resolution of ~ 10 arcminutes is 
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Fig. 11. — Same as Figure 6, but for the effect of the order of the polynomial fit: sensitivity estimates for the fiducial model but with the 
foreground subtraction performed by fitting polynomials of various degrees to the visibility spectra. In the bottom right panel, the possible 
variances are plotted for N = 1 (dashed line), N = 3 (solid line), and N = 5 (dash-dotted line). 



necessary in order to identify the higher harmonics of the 
BAO feature on the extracted power spectrum. This is 
because the residual noise would increase exponentially 
with the angular resolution, preventing significant detec- 
tions on large values of £. We note that, in the fiducial 
scenario, all the antennas point toward the NCP and 
continuously observe a fixed sky patch. Therefore, the 
integration time can be approximated by the observing 
time. However, the total observing time should be longer 
if we observe a different part of the sky, depending not 
only on the location of the interferometer but also on the 
sky coordinates of the patch being observed. Making a 
simplifying assumption that a single field of view can be 
observed for 6-8 hours in each d ay (|Datta et al. 1120091 
2010; Sar kar fe Bharadwai II201U) . the required integra- 
tion time (1 year) would be possibly achieved over a pe- 
riod of 3-4 years. In addition, the precision of BAO mea- 
surements is proportional to the survey volume. Fortu- 
nately, next generation radio telescopes will benefit from 
the rapid rate of increase of field-of-view and collecting 
area. Then the survey speed can be greatly accelerated, 
which is likely to be orders of magnitude higher than that 
achievable by optical spectrographs. Generally speaking, 
each radio interferometer should produce an instanta- 



neous FOV of ~ 100 deg 2 , and possess a collection area 
on the order of ~ 1000 m 2 . Moreover, several tens of 
separate pointings could be used to map out a signifi- 
cant fraction {e.g. ~ 1/2) of the sky within a limited 
observation time. 

We investigated here the role of blind 21 cm surveys 
in BAO measurements. Our results suggest that radio 
interferometers have the potential to become very com- 
petitive facilities for performing cosmologically interest- 
ing tests. While it is a challenging experiment to achieve 
constraints on the order of 1% in the BAO measurements, 
these calculations expand our understanding of the ef- 
fects of instrument design, observing strategy and data 
analysis techniques, offering a new angle for the future 
generation of neutral hydrogen surveys. 
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